On the time variability of 7-ray sources: A numerical analysis of variability indices 
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O , We present a Monte Carlo analysis of the recently introduced variability indices r (Tompkins 1999) and / (Zhang et al. 2000 
■ & Torres et al. 2001) for 7-ray sources. We explore different variability criteria and prove that these two indices, despite 
the very different approaches used to compute them, are statistically correlated (5 to 7a). This conclusion is maintained 
also for the subset of AGNs and high latitude (\b\ > 10 deg) sources, whereas the correlation is lowered for the low latitude 
ones, where the influence of the diffuse galactic emission background is strong. 
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' 1. Introduction 

o 



The study of the time variability of 7-ray sources, particularly using the Third EGRET Catalog (Hartman 1999), 
is currently a very active topic of research. The Third EGRET Catalog includes observations carried out between 
April 22, 1991 and October 3, 1995, and lists 271 point sources. About two thirds of them have no conclusive 



O counterparts at lower frequencies. Even worse, 40 of them do not show any positional coincidence (within the 95% 
1^ ' EGRET contour) with possible 7-ray emitting objects known in our galaxy (Romero et al. 1999). 

' In order to understand the origin of all these unidentified detections, their variability status is of fundamental 
, importance. Several known models for 7-ray sources in our galaxy would produce non-variable sources during the 
■ timescale of observations. That is the case of pulsars (Thompson 2001) or supernova remnants in interaction with 
molecular clouds (Esposito et al. 1996, Combi et al. 1998, 2001). Alternatively, if some of the sources are produced 
by isolated magnetized black holes (Punsly et al. 2000), microquasars (Paredes et al. 2000), or by stellar winds of 
early type stars (Benaglia et al. 2001) one would expect high levels of flux variability. 

Looking at the flux evolution through the different viewing periods is obviously a first indication of the variability 
status of any given source (see for instance Tavani et al. 1997). However, fluxes are usually the result of only a 
handful of incoming photons. A safer way of quantifying the flux evolution should be devised before obtaining 
significant results. 



2. Variability indices 

2.1. The V-index for 7-ray variability 

Three variability indices have been introduced in the literature so far. The first of them, dubbed V, was presented 
by MacLauglin et al. (1996), who computed it for the sources contained in the Second EGRET Catalog. This 
method was later used, also, for a short timescale study by Wallace et al. (2000). The basic idea behind V is to 
find x 2 for the measured fluxes, and to compute V — — logQ, where Q is the probability of obtaining such a \ 2 if 
the source were constant. Several critiques have been mentioned concerning this classification, among them, that 
the scheme gets complicated when the fluxes are just upper limit detections. It can be shown that sources which 
have upper limits included in the analysis will have a lower V than that implied by the data (Tompkins 1999). In 
addition, a source can have a large V because of intrinsic reasons -the case we would be interested in-, or because 
of small error bars on the flux measurements. Similarly, a small value of V can imply a constant flux or big error 
bars. Each value of V is obtained disregarding those of a control population. Then, we can have pulsars with very 
high values of V, or observed AGNs with very low ones. The use of V to classify the variability of 7-ray sources 
seems not to be very confident. 
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2.2. The r-index for 7-ray variability 

Tompkins (1999) introduced a new variability criterion which takes into account not only published EGRET data, 
contained in the point source 3EG Catalog, but also unpublished information. In order to decide the variability 
index for a given source he used also the 145 marginal sources that were detected but not included in the final 
official list, and, all at a time, the detections within 25 deg of the source of interest. The maximum likelihood set 
of source fluxes was then re-computed. From these fluxes, a new statistics measuring the variability was defined as 
t = cr//i, where a is the standard deviation of the fluxes and [i their average value. The strength of this approach 
lies in that it takes into account some possible fluctuations from the background and from neighboring sources, 
careful sensitivity corrections throughout EGRET lifetime, and others systematic errors related either with the 
equipment itself or with the processing of the information, in a similar way to that used in the construction of the 
3EG (Hartman et al. 1999, Tompkins 1999). Details are to be given in Tompkins et al. (2001). The final result of 
Tompkins' analysis is a table listing the name of the EGRET source and three values for r: a mean, a lower, and 
an upper limit (68% error bars). 



2.3. The /-index for 7-ray variability 

This index was previously used in blazar variability analysis (Romero et al. 1994) and applied to some of the 
3EG sources by Zhang et al. (2000) [] and Torres et al. (2001). The basic idea is to do a direct comparison 
of the flux variation of any given source with that shown by pulsars, which is considered spurious. Then, the 
/-index establishes how variable a source is with respect to the pulsar population. Contrary to Tompkins' index, 
the /-scheme uses only the publicly available data of the 3EG Catalog. 

The / index is defined as follows. Firstly, a mean weighted value for the EGRET flux is computed: 
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N vp is the number of single viewing periods for each 7-ray source, F(i) is the observed flux in the « th -period, 
whereas e(i) is the corresponding error in the observed flux. These data are taken directly from the 3EG catalog. 
For those observations in which the significance (y/TS in the EGRET catalog) is greater than 2a, we took the 
error as e(i) — F(i)/yTS. For those observations which are in fact upper bounds on the flux, it is assumed that 
both F(i) and e(i) are half the value of the upper bound. Then, the fluctuation index /i is defined as: 



H = 100 x cr sd x (FY 



(2) 



In this expression, cr s d is the standard deviation of the flux measurements, taking into account the previous 
considerations. 

This fluctuation index is also computed for the confirmed 7-ray pulsars in the 3EG catalog, assuming the 
physical criterion that pulsars are -i.e. by definition- non-variable 7-ray sources. Then, any non-null /x-value for 
pulsars is attributed to experimental uncertainty. Finally, the averaged statistical index of variability, /, is given 
by 
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(3) 



In Fig. 1 we show the histogram of / for 258 7-ray sources in the 3EG Catalog, and in Fig. 2, the sky distribution. 



3. From variability indices to variability criteria 



3.1. Plausible criteria for r 

The index r moves from upwards, and it is considered infinite when it is greater than 10 000. As can be seen 
from Tompkins (1999), the thresholds for variability are diffuse. To have an idea of what a variable source is 
under the r-scheme, Tompkins has separated the 7-ray sources into different classes: pulsars, unidentified, AGNs, 
and sources spatially coincident with SNRs. He found that r can clearly distinguish between pulsars, whose mean 
r- value is 0.1, having the highest upper limit equal to 0.27, and AGNs, whose mean is 0.9. The unidentified sources 
have r-values pertaining to both categories. Many sources clearly have a dubious classification; for instance, 3EG 
J1339-1419 has a mean r-value equal to 0.68, but their lower and upper limits are, respectively, 0.17 and 1.70. 

1 These authors considered as part of the control population sources not recognized as pulsars in the 3EG Catalog. See Torres et al. 
2001 for a discussion. 
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Fig. 1: Variability index distribution for 258 7-ray sources in the Third EGRET Catalog, excluding pulsars and six artifacts 
related with Vela. 
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Fig. 2: Sky variability distribution, under the /-scheme, of 258 7-ray sources in the Third EGRET Catalog. Up triangles 
stand for (25) variable sources, circles for (157) dubious, and a down triangles for (76) non-variable. Thresholds are as in 
Table 1. 



Then, within the 68% error bars on r, this source can be as variable as an AGN, or as non-variable as a pulsar. 
This is an uncomfortably common situation for many sources. 

However, if the lower limit on t is greater than, say, 1.0, the source is very likely variable. If the upper limit on r 
is, on the contrary, compatible with the r-values for pulsars, we would classify it as non-variable. This encompasses 
the spirit of Tompkins' (1999) classification of the most likely variable and the most likely non-variable sources. 
The question is then what the thresholds should be. We have seen that pulsars are consistent with values of r up 
to 0.27. The deviation for the mean value of pulsars is ~ 0.1. Then, it appears safe to consider that a source will 
be likely variable -under the t scheme- when the lower limit on r is at least 0.6, 3cr above the mean value of the r 
upper limit for pulsars. Equivalcntly, a source will be considered non-variable when the upper limit for r is below 
that threshold. Sources not fulfilling either classification should be considered as dubious. 

We can modify these thresholds in a number of ways, but we want pulsars to represent a non- variable population, 
and AGNs to be, on average, a variable one. But even fulfilling these constraints, we could better use a 2 or 4cr level 
as a safe assumption, or pretend to artificially move the threshold to ~ 0.3, just above the highest possible level for 
pulsar variability. We have explored these assumptions case by case, by means of a computer code described below, 
and although we found no statistically strong variations in the final classification, we did find that a threshold of 
0.5-0.6 is the safer. Known variable sources end up classified as variable, known or expected non- variable ones also 
get their right status. 

3.2. Plausible criteria for / 

One possibility for defining a variability criterion for / is also to consider the error bars for each source: 

SI = - — ^ 6 < fi > pu l sa rs~ 0.5 /. (4) 

^ ^ ^pulsars 

Here 6 < fi > pu isars is the deviation from the < /i > pulsars - value. Then, we have just propagated through I the 
error in defining the mean value of the fluctuation index for pulsars. We can then define variable sources as those 
fulfilling the constraint 

I -61 > I p + 3cr, (5) 

and non-variable sources as those having 

I + SI <I p + 3cr. (6) 

Here, I p — 1.0, is the mean value of / for pulsars, and a — 0.5, is the deviation in the pulsar /-values. Again, sources 
not fulfilling neither classification arc to be considered dubious. Then, rephrasing the previous two equations we 
get variable sources when / > 5.0, non- variable sources when / < 1.7, and dubious cases for /-values in between. 
These are very conservative and restrictive constraints, and have close analogy with the proposed ones for the 
r-index. Particularly, notice that if / > 5.0 is the threshold for a variable source, then we are asking for the value 
of / to be 8cr = 8 x 0.5 times above that of pulsars. Similarly, for a source to classify as non-variable, its /-index 
should depart from that of the pulsars in less than 1.4<r. 

This may sound excessive. In addition, why does 3a appear in Eqs. (5) and (6), not 2 or 4? We can as well 
use the mean value of / in a direct way, so defining a more straightforward scale. We can assume a source to be 
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Table 1: Classification of all 258 sources (excluding pulsars and six artifacts related with Vela) reported in the Third EGRET 
Catalog. Pulsars are non-variable sources in both schemes. The r-threshold is equal to 0.6, and the /-thresholds are equal 
to 1.7 and 5.0, respectively. The row dubbed 'Same class' shows the number of sources that classify within the same group 

in both schemes. 

Scheme variable dubious non-variable 

~1 25 157 76 

r 55 139 64 

Same class 17 95 36 



Table 2: Classification of confirmed AGNs reported in the Third EGRET Catalog. Thresholds are as in Table 1. 
Scheme variable dubious non-variable 

7 10 42 15 

r 15 40 12 

Same class 7 30 7 



non-variable if its /-value is less than 1.5 {la = 1 4- 0.5 above the pulsars), dubious for 1.5 < I < 3.5, and variable 
for / > 3.5, 5ct above the mean /-index for pulsars. That would be, although less restrictive, as good a criterion as 
the previous one. Changing the criteria will obviously change the variability status of those sources with values of 
r and / near the boundaries. We then need to explore all these possibilities in a systematic way before extracting 
significant conclusions. 

4. Results of the numerical analysis 

We have written a numerical code that classifies the source variability, given any chosen criteria, both in the / and 
the r scheme. In the Internet address provided below we present complete tables quoting together the / and r 
indices for each of the sources in the 3EG Catalog. We present in Table 1 the results for the classification using 
the above explained criteria: r-threshold equal to 0.6, and /-thresholds equal to 1.7 and 5.0, respectively. There 
are 148 detections out of 258 -five pulsars and six artifacts related with Vela were excluded- (57% of the sources) 
which classify within the same groups both for / and r. Can this percentage be obtained randomly? 

We have simulated thousands of sets of 258 sources and assigned to each of them a random variability index 
/. We preserved the histogram for /, i.e. the number of variable, dubious, and non-variable detections is the 
same in each of the simulated sets, but they are assigned to randomly chosen sources. Should we not preserve the 
histogram for /, we would admit, for instance, a random case in which all sources are variable, another in which all 
are non-variable, etc. This would diminish the random probability of obtaining the real result in an inappropriate 
way. What we want to test is the actual classification which associates a particular source with a particular value 
of /; this is why, while maintaining the / distribution, we shuffle the associations. 

Not only the percentages of equal classification are important in order to decide if the two schemes are sta- 
tistically correlated, but also the expected random result. For instance, if the thresholds are chosen such that all 
sources are non-variable in both schemes, then the percentage of equal classification would be 100%. But so would 
be the random percentage for each of the simulated sets, and then there would be no correlation at all. 

We found that the expected random result is 104.8±6.3, i.e. 7a below the real result, implying for it a Poisson 
probability equal to 8xl0~ 6 . We have also used several alternative plausible thresholds both for / and r, for 
instance, T-thresholds equal to 0.8, 0.5, and 0.35, with /-thresholds equal to 5.0/2.0, 5.0/1.7, and 3.5/1.5. In all 
cases, we obtain a percentage of equal classification above 50%, the worst random result (obtained for T-thresholds 
equal to 0.35 and /-thresholds equal to 3.5/1.5) being still 5a lower than the real one. Thus, disregarding the fine 
grain of the variability criteria, the two schemes are statistically correlated. We have also explored what happens if 
we do not consider those sources having an average recomputed flux equal to 0.00 within the r-scheme (Tompkins 
1999). Doing the simulations excluding these sources produces an even more correlated result. 

In Table 2 we show the results for the 67 AGNs. 44 (65%) of them have the same classification within both 
schemes, while we would expect only 31±3.0 as a result of chance, 5a lower than the real result. Again, changing 
the criteria does not significantly alter the results (and in most cases actually improves them). 

Table 3 shows the results both for high (\b\ > 10 deg) and low latitude sources. For the former, the random 
result is Aa below the real one. Changing the criteria to all other plausible ones we have discussed above enhances 
the correlation. For the low latitude sources, the result is 3tr away from the real one. Here, changing the criteria 
to any other of the plausible ones we mentioned does not generally improve the correlation. The decrease in 
statistical correlation between / and r at low galactic latitudes, for the less restrictive criteria, could be reflecting 
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Table 3: Classification of unidentified sources reported in the Third EGRET Catalog. The upper panel refers to high latitude 
sources and the lower one to low latitude detections. Thresholds are as in Table 1. 



Scheme 


variable 


dubious 


non- variable 


I 


12 


74 


34 


T 


34 


61 


25 


Same class 


8 


39 


15 


I 


3 


41 


27 


T 


6 


38 


27 


Same class 


2 


26 


14 



the uncertainties in the subtraction of the diffuse background emission of the galaxy (Hunter et al. 1997, Strong 
et al. 2000). 

5. Discussion and concluding remarks 

The status of a particular source can vary from one scheme to the other. Then, the joint use of / and r can provide 
a better idea of the variability status of any given source. Particular classifications may disagree as a result of 
completely different techniques for computing the variability indices. Note that the dubious classification of any 
r plausible criterion is applied upon sources we know nothing about. This is not the case for /, since it always 
provides a scale relative to the mean of the pulsar fluctuation indices. Then, it rests on our own judgement to 
decide the weight we shall give to a result like / = 3.0, but it undoubtedly says that the flux evolution is three 
times more variable than the mean flux evolution for pulsars. We mention that in order to get more reliable results 
under the /-scheme at low latitudes it seems safer to consider the most restrictive cutoffs. 

It has been noted (R. Hartman, private communication, 2001) that the weighting used for the definition of / 
(as done in Zhang et al. (2000) and Torres et al. (2001)): an inverse square for the errors in the fluxes, could 
provide values of < F > unrealistically high. The combined use of unweighted averages, and the definitions 
F ~ VTS/ (2 + VrS) and err(F) ~ 1/(2 + yTS) for the the exposures showing only upper limits, could even 
improve the correlation with Tompkins' index. But this would be another index for quantifying variability, not yet 
used in the literature. 
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